Deep multimodal fusion for semantic image segmentation: A survey
نویسندگان
چکیده
Recent advances in deep learning have shown excellent performance various scene understanding tasks. However, some complex environments or under challenging conditions, it is necessary to employ multiple modalities that provide complementary information on the same scene. A variety of studies demonstrated multimodal fusion for semantic image segmentation achieves significant improvement. These approaches take benefits sources and generate an optimal joint prediction automatically. This paper describes essential background concepts relevant applications computer vision. In particular, we a systematic survey methodologies, datasets, quantitative evaluations benchmark datasets. Existing methods are summarized according common taxonomy: early fusion, late hybrid fusion. Based their performance, analyze strengths weaknesses different strategies. Current challenges design choices discussed, aiming reader with comprehensive heuristic view segmentation.
منابع مشابه
Towards Robust Semantic Segmentation using Deep Fusion
Robust semantic scene understanding of unstructured environments is critical for robots operating in the real world. Several inherent natural factors such as shadows, glare and snow make this problem highly challenging, especially using RGB images. In this paper, we propose the use of multispectral and multimodal images to increase robustness of segmentation in real-world outdoor environments. ...
متن کاملRobust Semantic Segmentation using Deep Fusion
Robust semantic scene understanding of unstructured environments is critical for robots operating in the real world. Several inherent natural factors such as shadows, glare and snow make this problem highly challenging, especially using RGB images. In this paper, we propose the use of multispectral and multimodal images to increase robustness of segmentation in real-world outdoor environments. ...
متن کاملSurvey on Multimodal Medical Image Fusion Techniques
An Image fusion is the development of amalgamating two or more image of common characteristic to form a single image which acquires all the essential features of original image. Nowadays lots of work is going to be done on the field of image fusion and also used in various application such as medical imaging and multi spectra sensor image fusing etc. For fusing the image various techniques has ...
متن کاملSeason-Invariant Semantic Segmentation with a Deep Multimodal Network
Semantic scene understanding is a useful capability for autonomous vehicles operating in off-roads. While cameras are the most common sensor used for semantic classification, the performance of methods using camera imagery may suffer when there is significant variation between the train and testing sets caused by illumination, weather, and seasonal variations. On the other hand, 3D information ...
متن کاملMultimodal medical image fusion based on Yager’s intuitionistic fuzzy sets
The objective of image fusion for medical images is to combine multiple images obtained from various sources into a single image suitable for better diagnosis. Most of the state-of-the-art image fusing technique is based on nonfuzzy sets, and the fused image so obtained lags with complementary information. Intuitionistic fuzzy sets (IFS) are determined to be more suitable for civilian, and medi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Image and Vision Computing
سال: 2021
ISSN: ['0262-8856', '1872-8138']
DOI: https://doi.org/10.1016/j.imavis.2020.104042